PyDigger - unearthing stuff about Python


NameVersionSummarydate
kiwi-pdf-chunker 0.3.3 A tool for parsing PDF document layouts and chunking content 2025-10-08 10:31:42
echogem 1.0.0 Intelligent Transcript Processing and Question Answering Library 2025-09-01 04:34:11
chunklet 1.4.0 A smart multilingual text chunker for LLMs, RAG, and beyond. 2025-08-28 05:06:01
chunklet-py 1.4.0 A smart multilingual text chunker for LLMs, RAG, and beyond. 2025-08-28 05:05:02
treesitter-chunker 2.0.0 Semantic code chunker using Tree-sitter for intelligent code analysis 2025-08-21 02:13:49
semchunk 3.2.3 A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks. 2025-08-13 03:31:59
rag-document-viewer 1.1.1 RAG Document Viewer 2025-08-11 16:21:26
chunkwrap 2.4.1 Command-line tool to select code/docs for LLMs with secret masking and a hard cap on final output size. 2025-08-11 11:37:09
smartchunkllm 0.1.7 Advanced Legal Document Semantic Chunking System 2025-08-10 21:52:24
chunkipy 1.0.0.post1 Chunkipy is an easy-to-use library for chunking text based on the size estimator function you provide. 2025-08-08 12:37:03
llama-index-packs-node-parser-semantic-chunking 0.4.0 llama-index packs node_parser integration 2025-07-30 21:33:16
docling-analysis-framework 1.1.0 AI-ready analysis framework for PDF and Office documents using Docling for content extraction 2025-07-29 14:34:10
llm-text-splitter 0.2.0 A lightweight, rule-based text splitter for LLM context window management, handles multiple file formats and enriches chunks with metadata. 2025-07-24 12:21:01
llm-agent-toolkit 0.0.32.8 LLM Agent Toolkit provides minimal, modular interfaces for core components in LLM-based applications. 2025-04-21 07:43:48
ai-chunking 0.1.4 A powerful Python library for semantic document chunking and enrichment using AI 2025-03-16 20:44:19
betterhtmlchunking 0.9.1 A Python library for intelligent HTML segmentation and ROI extraction. It builds a DOM tree from raw HTML and extracts content-rich regions for efficient web scraping and analysis. 2025-02-14 08:21:28
alphacodings 0.2.0 base26 ([A-Z]) and base52 ([A-Za-z]) encodings 2024-12-09 03:04:43
quackling 0.4.1 Quackling enables document-native generative AI applications 2024-09-11 13:26:57
llama-index-readers-preprocess 0.2.0 llama-index readers preprocess integration 2024-08-22 06:50:57
pypreprocess 1.4.3 Preprocess SDK 2024-08-11 08:00:57
hourdayweektotal
8721676767325604
Elapsed time: 9.67970s